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Abstract —The idea of opposition-based learning was intro¬ 
duced 10 years ago. Since then a noteworthy group of researchers 
has used some notions of oppositeness to improve existing opti¬ 
mization and learning algorithms. Among others, evolutionary 
algorithms, reinforcement agents, and neural networks have 
been reportedly extended into their “opposition-based” version 
to become faster and/or more accurate. However, most works 
still use a simple notion of opposites, namely linear (or type- 
I) opposition, that for each x e [a, b] assigns its opposite as 
xi = a b — X, This, of course, is a very naive estimate of the 
actual or true (non-linear) opposite xn, which has been called 
type-II opposite in literature. In absence of any knowledge about a 
function y = /(x) that we need to approximate, there seems to be 
no alternative to the naivety of type-I opposition if one intents to 
utilize oppositional concepts. But the question is if we can receive 
some level of accuracy increase and time savings by using the 
naive opposite estimate xj according to all reports in literature, 
what would we be able to gain, in terms of even higher accuracies 
and more reduction in computational complexity, if we would 
generate and employ true opposites? This work introduces an 
approach to approximate type-II opposites using evolving fuzzy 
rules when we first perform “opposition mining”. We show with 
multiple examples that learning true opposites is possible when 
we mine the opposites from the training data to subsequently 
approximate xu = /(x, y). 

I. What is the Problem? 

It was 10 years ago that “opposition-based learning” 
(OBL) was born. Since then, a modest but growing commu¬ 
nity of researchers has tried to use OBL to improve diverse 
optimization and learning techniques. Evolutionary algorithms, 
reinforcement agents, swarm-based methods, and neural net¬ 
works, to mention a few, have been extended to employ op¬ 
positeness in their processing. Evaluating the error of weights 
and opposite weights, examining the rewards for actions and 
counter-actions, and examining the fitness of chromosomes 
and anti-chromosomes are examples for rethinking existing 
concepts by embedding opposite entities. Apparently, as a re¬ 
view of literature easily illustrates, using opposites can actually 
help accelerating many optimization and learning processes 
(see section The main benefit of using opposites seems 
to be contributing to an accelerated convergence. Apparently 
by simultaneous consideration of entities and opposite entities, 
algorithms become capable of jumping over large portions of 
the solution landscape for difficult problems when they do not 
exhibit any significance. 

Since the first paper was published on OBL in 2005 1321 . 
it has not been really clear how oppositeness can actually be 
captured with respect to the intrinsic behaviour of the problem 
at hand. The proposed scheme was that for any given x e [a, 6] 
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the opposite x can be given via x = a ^ b — x. This, if at 
all, only makes sense for “linear” functions, in which case 
no particular algorithmic sophistication is required to begin 
with. But in spite of its simplicity, many works have reported 
benefits for using x = a ^ b — x, which is called “type I 
opposite” in context of opposition-based computing 1^ . 

After 10 years, some questions are still unanswered: Is 
there such thing as type II opposites? If they exist, how can 
we calculate them? And most importantly, would using type II 
opposites bring any benefit to existing machine-learning and 
optimization methods? 

11. The Idea 

What we call type II (or true) opposite of x, denoted with 
Xu, is supposed to be meaningful for “non-linear” mappings 
and relationships, in contrast to type I opposites x^ that latently 
assume a linear relationship between in- and outputs. 

Looking at the function y = /(xi, X 2 ,..., Xn) in a typical 
machine-learning or optimization context, one is generally 
fortunate to receive the output values y for some input variables 
xi,X 2 ,..., However, the function y = /(•) itself is 
generally unknown otherwise there would be little justification 
for resorting to machine intelligence. Instead, one has some 
sort of evaluation function g{-) (error, reward, fitness etc.) that 
enables us to assess the quality of any guess f i, ;r 2 ,..., 
delivering an estimate y of the true/desired output y. 

The idea proposed in this paper is to use training data, 
whenever available, to perform opposition mining in order 
to approximate type II opposites gradually by learning the 
X, X//-relationship, or more precisely by learning xn = 
f{x^y). Of course, if a large number of training data is 
available, then one could also just apply them at once instead 
of perpetual/continuous change. 

Eor every input x, in contrast to the type I opposites which 
are defined according to 

xi = a ^b — x^ (1) 

we propose to use the more meaningful type II opposites 
according to 

^min H“ ^max (^) 

There are several challenges for such an approach: I) the range 
of the output y = /(•) may not be a-priori known, hence 
we may need to continuously (in an evolving manner) update 
our knowledge of the output range [^min, ^max], 2) the exact 
output for a type-II opposite may not be present in data. Hence, 
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the approach should be capable of improving its estimation 
as data stream provides more clues about the gaps in our 
estimate of the in- and output relationship, and 3) sufficient 
(representative) data may not be available at any given time; 
another reason in favour of an evolving approach. 

In this work, we put forward a concise algorithm to learn 
opposites via fuzzy inference systems (FIS) with or without 
evolving rules depending on application whereas the evolving 
fuzzy systems may offer more versatility in many challenging 
applications (section |IV]). Before that, we briefiy review the 
existing literature in section m Experiments and results, to 
verify the correctness and the usefulness of the proposed 
approach are described in section |V] 

III. Reviewing Previous Works 

In this section, we briefiy review the existing literature on 
opposition-based learning and the evolving fuzzy systems. 

A. Opposition-Based Learning 

Opposition-based learning was introduced in 2005 (321. 
Motivated by finding an alternative for random initialization, 
it was observed that for many algorithms we start with a 
random guess and hope to move toward an existing solution in 
a speedy fashion. Examples were named such as the weights 
of a neural network, the population in genetic algorithms, and 
the action policy of reinforcement agents. Eurther, the paper 
stated that if we begin with a random guess far away from the 
existing solution, “let say in worst case it is in the opposite 
location, then the approximation, search or optimization will 
take considerably more time, or in worst case becomes in¬ 
tractable”. The paper advocated that “we should be looking 
in all directions simultaneously, or more concretely, in the 
opposite direction” in order to have a higher chance of finding 
a solution in a shorter time, and established the following 
definition (321: 

Opposition-Based Learning - Let f{x) be the function in 
focus and g{-) a proper evaluation function. If x G [a, 6] is 
an initial (random) guess and x is its opposite value, then 
in every iteration we calculate f{x) and f{x). The learning 
continues with x if g{f{x)) > g{f{x)), otherwise with x. The 
evaluation function ^(•), as a measure of optimality, compares 
the suitability of results (e.g. fitness function, reward and 
punishment, error function). 

Of course, after many reports in the past 10 years we 
know that x should not necessarily be a random guess at the 
beginning, but it can be a calculated or estimated value by a 
learning or optimization algorithm at any stage of the process. 
As well, we may not necessarily need to evaluate g{f{x)) and 
g{f{x)) in all iterations. 

Rahnamayan et al. introduced opposition-based differential 
evolution (ODE) which seems to be one of the most successful 
applications of OBL via integration within an existing algo¬ 
rithm insi-Ea. Among other early works, there are papers on 
the application of OBL for reinforcement learning (33l, (^ 
where “opposite actions” were defined for specific problems, 
hence, not engaging in question of type-I versus type-II op¬ 
posites. Defining opposite fuzzy sets, which may be mistaken 
with “antonyms” and their applications have been introduced 


too ii, Ea, Tizhoosh and Ventresca (35]| presented 
a complete collection of oppositional concepts in machine 
intelligence in an edited volume, in which the term “type- 
II opposition” was coined in the chapter opposition-based 
computing. Ventresca and Tizhoosh also extended simulated 
annealing (38]| and introduced the notion of opposite transfer 
functions for neural networks (^ . 


Surveys are available to provide an overview of OBL 
methods. Eor instance, Al-Qunaieer et al. m provide a com¬ 
pact survey of oppositional methods. The most comprehensive 
survey of opposition-based methods so far has been published 
by Xu et al. (40l . 


Type II opposites have not been examined in-depth. How¬ 
ever, as we will briefiy review them in the following two 
paragraphs, recently two papers have independently proposed 
different possibilities for estimating type II opposites. 

Mahootchi et al. ca provided some new ideas for type-II 
opposites when dealing with reinforcement learning methods 
such as Q- and sarsa learning. Looking at actions a that 
an agent could take to manipulate its environment, the type- 
I opposite action d was originally defined as ‘directional’ 
opposite for grid-world agents, hence “up” was the opposite of 
“down”, for instance. However, in a stochastic environment it 
makes sense to examine the discounted accumulated rewards 
Q{s, a) for each state s and estimate the opposite as follows: 


dG < d\Q^{i, d) ^maxQ^{i,b) — Q^{i, a) > 

(3) 

To obtain new knowledge about the opposites from observa¬ 
tions, the authors train an MLP network which is updated every 
several episodes. Eor higher accuracy, the approximate function 
could be updated online when the agent can interact with the 
stochastic environment. “In other words, after the first training 
of the network, two sets of action-value functions exist, with 
the first being used to find the action that the agent should take 
and the approximate one being used to extract the opposites. 
This means that there is a mutual relationship between the 
agent and the opposite agent. This process may assist each 
agent to accomplish its tasks more efficiently in terms of 
convergence time and accuracy in real-world applications” 
ca. The authors also reported that MLP may not be efficient 
for large applications due to its computational expense. They 
suggest that one may substitute MLP with other types of 
function approximations such as fuzzy inference systems. 


Salehinejad et al. (3Ql . examining a type-II extension of 
opposition-based differential evolution (ODE) also point to the 
fact that the true opposites should be calculated via f{x) = 
^min+^max~/(^) to focus ou the output but they use centroid- 
based method instead of depending on min and max specially 
where the landsc^e boundaries are unknown. A look-up table 
can help to find f{x), and if not present in the table, its value 
can be estimated via interpolation. In their algorithm, they use 
type-I and type-II opposition simultaneously. 


B. Evolving Fuzzy Systems 

An evolving fuzzy system has been initially introduced as 
an unsupervised method of updating the rule-based structure 
of a fuzzy inference system (LIS) in a non-iterative way (TJ, 
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El. The main idea is that FIS can be extended as more data 
becomes available. The fuzzy rule base evolves (is extended) 
by adding more rules to a basic rule set formed by the initial 
fuzzy model or by replacing existing rules with ones that 
approximate the data better. When data are captured online, 
its potential is calculated, and the potentials of existing cluster 
centers are recursively updated. The potential of the new data 
is compared with the existing centers and one of the following 
decisions is made: 1) Replace one or more existing clusters 
with the new data point if its potential is higher than a certain 
threshold and the new data point is close to an old center, 2) 
Add the new data as a new cluster if its potential is higher 
than a certain threshold. 

The term “evolving” may imply that instead of retraining 
the initial system, the system should be updated online when 
new data become available. However, the evolution of rules in 
a fuzzy inference system can occur in a variety of ways, and 
as long as the initial fuzzy model (rule set) is continuously 
updated, no preference exists with respect to how the actual 
update is performed. In m, the Takagi-Sugeno (T-S) type 
of fuzzy controller with an evolving structure was proposed 
for online identification of T-S systems. The fuzzy rules of 
this controller are expanded through the use of data collected 
during the control process, and is, thus, trained in a non¬ 
iterative (recursive) way. There have been approaches that 
use an iterative approach specially for the application where 
keeping and re-training the data does not pose a challenge ca. 
More recent approaches to evolving fuzzy systems have been 
proposed Elia. Evolving rules have been used for modelling 
nonlinear dynamic systems m and image classification and 
segmentation ifTOlirailTl . 

Comprehensive treatment and reports on new advances in 
evolving fuzzy systems can be found in literature ifTTI . (121, 
ca, ca. There are other learning approaches one could use. 
For instance, participatory learning ED where compatibility 
between observations and beliefs plays an important role. 

In this paper, we propose a general, simple and efficient 
approach to learn type-II opposites using fuzzy inference 
systems. The emphasis is on opposition mining (finding 
opposites or quasi-opposites in the available training data) and 
evolving rules (which perpetually adjust the approximation of 
the mined opposites). 

IV. Learning Opposites via Evolving Rules 


where Xi and y are variables defined in corresponding uni¬ 
verses of discourse Xi and Y, respectively, and Ai is a fuzzy 
(sub)set. The function /j(xi, X 2 , • • •, x^) is a crisp (nonfuzzy) 
function of xi. In general, the function fj is defined as the 
weighted combination of all variables /j(xi, * * ? ^n) = 

IC - 0 + WiXi + W2X2 H-h WnXn- 

The output is then calculated by 


y = 






(4) 


where N is the number of fuzzy rules, n is the number 
of inputs (features), fij is the membership value of the ith 
input Xi for the jth rule, 1 < < n, and T is a T-norm 

representing the logical conjunction. 

In order to learn the type II (true) opposites with evolving 
fuzzy rules, we first have to sample the problem at hand to find 
existing (quasi-)opposites. This is the opposition mining stage 
that provides the data for clustering and the subsequent rule ex¬ 
traction. One generally assumes that the more data is available 
the better the approximation becomes because we expect that 
more data increases the probability for opposition mining to 
find more representative samples for rule generation in order 
to more accurately approximate xu = f{x,y). We assume 
that the range of input variables is known, Xi G [^min^^max]’ 
but the range of output, yj e 2/max] > may be a-priori 

unknown. Since we are looking for the true (type-II) opposites, 
we need to calculate the (quasi-)opposite of the output for each 
sample. Subsequently, we find the closest value to that point 
and select its corresponding input as the opposite of the given 
input. If the inflow of data continues, the reliability of the 
mined opposites as training data increases which would result 
in higher accuracies for the approximated true opposites. 

As preparation for designing the learning procedure, let us 
look at Figure 

• Calculating type I opposites is straightforward (top 
diagram). For any given x we calculate its opposite 
X = a b — X (denoted with ox on the x-axis) 
which can then be used to estimate the value of f{x) 
(denoted with f(ox) on the y-axis). However, as it 
is apparent from the diagram, calculation of ox is 
completely detached from the output. 


A fuzzy inference system (FIS) generally consists of a set 
of IF-THEN rules of the following form: 

IF xi is Ai AND X 2 is A 2 AND • • • AND Xn is An 
THEN y is B 

where Xi^y G X are variables defined in corresponding 
universes of discourse Xi and Y, respectively, and Ai and B 
are fuzzy (sub)sets (in this work we investigate the problems 
with many inputs and one output only). These rules can be 
defined by experts. However, in most real-world applications 
they can be extracted from available data via clustering. A 
Takagi-Sugeno, or T-S fuzzy inference system operates with 
fuzzy rules of the following general form ED: 

IF xi is Ai AND X 2 is A 2 AND • • • AND a;„ is A„ 
THEN y = fjixi,X 2 ,...,Xn),j = 1,2,---,N 


• Type II opposites, in contrast, are based on a more 
realistic (or more intuitive) understanding of opposite¬ 
ness (bottom diagram in Figure [^. For any given x 
(0), we first need an estimate or evaluation of f{x) 
( 0 ). Then, we find the opposite of f{x), namely f{x) 
or of(x) in Figure [^(0). Any input that can produce 
outputs like of(x) is the type II opposite of x (oxl, 
0x2 and ox3 on the x-axis) ( 0 ). Of course, for non¬ 
monotonic functions, we may get multiple opposites 
for each input. In most cases, one of those opposites 
might suffice to exploit the potential benefits of OBL 
when validated with the available objective function. 

Algorithm [^describes the steps how we learn the opposites 
based on the same steps in the bottom diagram of Figure 
The approach consists of two distinct stages: 
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Fig. 1. Type I versus Type II Opposition. TOP: 1) variable x is given, 2) 
opposite of X, xo, is calculated via a+b-x . BOTTOM: 1) variable x is given, 2) 
f(x) is calculated, 3) opposite of f(x), namely of(x) is calculated, 4) opposite(s) 
of X are found: oxl,ox2 and ox3. 


Opposition Mining (Lines 7 to 24) - The training data 
is sampled to established the output boundaries. Depending 
on a specific oppositeness scheme, all data points are then 
processed to find the (quasi-)opposite of each input and the 
corresponding input xu, which is the type-II opposite. At the 
end of the opposition mining we have Ug opposites. 

We also look at different ways of calculating opposition 
in general. Given any function y = f{x) with variables x G 
[xmin^^max] with a Sample average of x, one may calculate 
oppositeness for the input domain in different ways 1^ . Ea, 
1211, EH (% denotes the modulo operator): 


Ti : 

XI 

— ^min T ^max ^ 

(5) 

T 2 ; 

XI 

f ^min T ^max ^ 

= (x+ J 

1 % ^max (6) 

T 3 : 

Xi 

= 2x — X 

(7) 


Analogously, given the output y G [^min, ^max] with an average 
y, one may calculate its opposite yj in different ways: 

• y — ^min “t“ ^max V (8) 

^2 : y = I ^ H-^- 1 % ^max (9) 

Ts : y = 2y-y (10) 

For Ts scheme, we may go out of the existing boundary of 
the variables sometimes. For the experiments, we solved this 
problem by switching to scheme Ti whenever necessary. It is 
paramount to emphasize that any opposite calculate based on 
^ is a type-II opposite. 


Learning the Opposites (Lines 26 to 30) - Here any 
learning and approximation method may be used to employ 
the results of opposition mining. As for our approach, namely 
the fuzzy inference systems (FIS), in- and outputs are clustered 
to extract rules. Performing fuzzy inference with these rules 
will then approximate the type-II opposites for new (unseen) 
inputs. 

In following section, we report the results of some experi¬ 
ments to verify the superiority of type II over type I opposites. 


Algorithm 1 Learning the opposites via Evolving Rules 

1 : -Initialization- 

2 : Set FIS mechanism, e.g. Takagi-Sugeno 
3: Determine the number of samples Ug 
4: Set the number of clusters Uc 
5: Select oppositeness scheme Ti,i G {1,2,3} 

6 : -Opposition Mining- 

7: Get sample points < xl,X 2 ,... ,y^ > (i = 1, 2,..., rig) 
8 : Calculate ymin, Pmax, and y 
9: for i = 1 : rig do 
10 : thisY ^ y{i)\ 

11 : Use Ti to estimate the opposite of the ouput: 

12 : Ti — T\. OppY i ^min T ^max thisY 

13: Ti = Ta: oppY ^ (thisY + % y^ax 

14: Ti = T 3 : oppY ^ 2y — thisY 

15: minDiff ^ oc (a large number) 

16: for j = 1 : rig do 

17: thatY ^ y{j) 

18: thisDiff ^ \oppY — thatY\ 

19: if thisDiff < minDiff then 

20 : minDiff ^ thisDiff 

21: Xll{i) ^ x{j) 

22 : end if 

23: end for 

24: end for 

25: -Train FIS- 

26: Set < , X 2 ,..., > as input 

27: Set < x\ jj^x\ ... > as output 

28: Partition the data into Uc clusters 
29: Extract fuzzy rules 
30: Save the FIS data 


V. Experiments and Results 

We conduct multiple experiments to demonstrate the use¬ 
fulness of opposition mining and learning opposites with 
evolving fuzzy rules. We examine 9 simple functions with 
known inverse functions to verify the correctness of the 
algorithm, and to establish a better understanding of the ap¬ 
proach. Using functions with known inverse relations is a very 
reliable testing vehicle because the true opposites can be easily 
calculated for validation. As well, we employed Matlab™ 
“genfis3” function to implement a clustering-based Takagi- 
Sugeno fuzzy inference system (note: Matlab™ “genfis3” 
was trained incrementally to achieve the rule evolution). We 
experimented with number of clusters Uc to be 30 or 60 
clusters (with relatively similar results) and set the number 
of maximum iterations to maxlter= 5000. The fuzzy exponent 
was set to m = 2 and the clustering error threshold was fixed 
at e = 0.00001. 
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A. First Experiment Series 

The results for 9 functions are summarized in Table |l| (also 
see Figure [^. The simplicity of the functions enable us to 
easily formulate the inverse function such that we can exactly 
calculate the true opposites. With exception of fjix), the 
inverse of all other test functions are given in Table [T| 


(§+( 

f- 

29^ 


\ 2 29^ 

V2 

54. 

1 729^ 

1 54 ) 



For every function 100 random samples were drawn in 
the opposition mining stage. The error for xj and xu is 
calculated through direct comparison with the true opposite 
X* available through inverse functions: error(x/) = |x* — xi\ 
and error(x//) = |x* — xii\. 

Analysis of the Results - 

• Type-II opposite is clearly more accurate than type-I 
opposite with exception of linear function f 3 {x). 

• The opposition scheme Ti seems to be the best repre¬ 
sentation of oppositeness. T 2 is the best oppositeness 
scheme only for fs{x). 

• If we only examine type I opposites, T 3 is better than 
Ti and T 2 for 7 out of 9 test functions. 



0 200 400 600 800 1000 1200 


Fig. 2. Examples for the test functions: f 2 {x) (top) and fsix) (bottom) 
according to equations provided in Table [I| The simplicity of the functions 
allows us to calculate the inverse (hence the true opposites) but they are still 
difficult enough to demonstrate the limitations of the type-I opposition as large 
errors in Table U demonstrate. 


B. The Second Experiments Series 

In the second experiment series, we looked at the effect 
of evolving rules: What happens if data is fed into the system 
one-by-one or block-wise? Intuitively, one expects that through 
clustering of more data, a more compact rule base can be 
generated which can more accurately approximate the true 
opposites. 

As Figure shows some representative examples, the 
average error and standard deviation of approximated opposites 
decrease over time as the estimated values by evolving rules 
become more and more representative of the true opposites. 
We also observed a typical pinnacle point at which the 
error reaches its maximum when one observes the evolving 
opposites long enough (Figure [^. This appear to be due to 
the intrinsic nature of evolving steps as at the beginning there 
are not enough data points to extract good rules, thus resulting 
in large errors. 

C. Third Experiment Series 

In this section, we test 3 standard optimization functions 
which are commonly used in the literature of global optimiza¬ 
tion. For our convenience and without any loss of generality, 
we use only 2D functions, two variables xi and X 2 . We learn 
the opposites with Ug = 1000 samples and then test the 
function optimization with 0.1 x new samples. Two random 
variables are generated and used to call FIS for their opposites. 
The combination of these two may provide a lower error as 
seen for the examined functions. Now, as we have established 
the superiority of evolved type II opposites over type I op¬ 
posites in previous sections, here we also examine type II 
opposites in conjunction with random samples (also see IZTl . 
II26I ). Here we would like to verify whether the fundamental 
Statement of OBL is true: Simultaneous consideration of guess 
and opposite guess provides lower errors at each iteration of 
learning and optimization processes. 

Given a function f{xi,X 2 ) = 0 we generate two ran¬ 
dom samples x\ and X 2 and we calculate the error. Then, 
we approximate the opposites x^ and X 2 and calculate the 
error. As generally proposed in OBL schemes, we can si¬ 
multaneously look at all results and continue with the best 
results depending on the evaluation of the function call. 
That means we continue with the results of x^ and X 2 if 
Evror{f{xi,X 2 ))<Err:or{f{xi,X 2 )) otherwise with x^ and x^. 
The opposites can be type I (as used in existing literature) 
or type II (as we are proposing). The question is which one 
can provide a statistically significant benefit when used in 
conjunction with the initial random guess x^. 

Ackley Function - The Ackley function is given as 

f{xi,X 2 ) = 20 ^1 - exp ^-0.2^0.5(a:;f - (12) 

exp (0.5(cos (27rxi) + cos (27rx2))) + exp(l). 

The global minimum is f{xi,X 2 ) = /(3,0.5) = 0 
where the search domain is — 35 < < 35. The results for 

Ackley function are given in Table |I^ The random guesses 
(first column) and their type I opposites (third column) were 
statistically the same (null hypothesis could not be rejected). 
Their simultaneous consideration therefore does not provide 
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TABLE 1. The results eor test eunctions (best results highlighted): For every eunction both Type I and Type II opposites are 

CALCULATED USING THE THREE DIEEERENT OPPOSITENESS SCHEME. THE AVERAGE ERROR m AND ITS STANDARD DEVIATION a IS CALCULATED USING 

DIRECT COMPARISON WITH THE KNOWN TRUE OPPOSITE VIA INVERSE EUNCTION. 



Oppositeness 

Type-I Opposites 

Type-II Opposites 

Function and its inverse 

scheme 

m ± cr for 30 runs 

m ± cr for 30 runs 

fi{x) = {2x + 8)^ 

Ti 

378.91 ± 178.48 

22.62 ± 24.01 

= -S)/2 

T2 

337.20 ± 303.62 

78.12 ± 96.86 


Ts 

234.27 ± 187.97 

155.25 ± 228.39 

f 2 (x) = log(a: + 3) 

Ti 

478.06 ± 270.07 

18.95 ± 29.47 

= exp(2/) - 3 

T2 

470.85 ± 296.33 

24.00 ± 51.03 


Ts 

529.12 ± 1881.12 

557.06 ± 2051.47 

f 3 (x) = 2x 

Ti 

0 ± 0 

0.02 ± 0.01 

f^^(x) = y/2 

T2 

0 ± 0 

210.66 ± 137.20 


Ts 

0.51 ± 2.90 

0.82 ± 0.53 

fi{x) = 

Ti 

285.91 ± 117.22 

21.77 ± 19.91 

= Vy 

T2 

248.87 ± 297.11 

118.34 ± 129.50 


Ts 

152.22 ± 129.68 

120.17 ± 185.21 

f^ix) = y/x 

Ti 

313.05 ± 140.76 

0.028 ± 0.018 

f^^{x) = y‘^ 

T2 

317.94 ± 335.52 

161.85 ± 156.19 


Ts 

96.79 ± 99.41 

101.09 ± 156.69 

fe(x) = x^^^ 

Ti 

183.72 ± 72.06 

15.38 ± 13.35 


T2 

171.61 ± 280.48 

150.75 ± 142.32 


Ts 

83.11 ± 73.16 

59.28 ± 98.56 

frix) = x^ x^ -\-l 
(for f~^(x) see Eq|nj 

Ti 

380.63 ± 177.82 

22.98 ± 26.77 

T2 

328.18 ± 297.96 

75.55 ± 93.71 

Ts 

131.57 ± 85.13 

130.22 ± 177.85 

fsix) = 1/x 

Ti 

503.26 ± 292.20 

3.46 ± 13.49 

fQ^(x) = 1/y 

T2 

496.60 ± 289.02 

0.56 ± 0.27 


Ts 

434.52 ± 338.03 

39.72 ± 163.92 

fg{x) = ^y(x + l)/3 

Ti 

306.25 ± 138.12 

0.03 ± 0.02 

f-\x) = 9y^-l 

T2 

299.37 ± 324.00 

159.08 ± 165.29 


Ts 

95.63 ± 92.46 

97.60 ± 147.43 


any benefit. In contrast, the type II opposites (second column) 
are not only different but also exhibit lower errors. 


TABLE 11. Errors eor Ackley Function 



Xi and X 2 

^1,/j ^ 2,11 

x^ j and X 2 j 

1. run 

284.89 ± 263.32 

10.35 ± 9.21 

284.89 ± 263.32 

2. run 

20.45 ± 2.39 

2.09 ±2.09 

20.45 ± 2.39 

3. run 

20.61 ± 1.50 

3.90 ± 0.41 

20.61 ± 1.50 

4. run 

20.24 ± 2.27 

2.40 ± 1.19 

20.24 ± 2.27 

5. run 

20.63 ± 1.71 

4.06 ± 1.01 

20.63 ± 1.72 


Booth Function - The Booth function is given as 

/(Xf, X 2 ) = {xi + 2X2 - 7)^ + (2xi + X 2 - 5)^. (13) 

The global minimum is /(xi,X 2 ) = /(1:3) = 0 where 
the search domain is —10 < < 10. The results for Booth 

function are given in Table |II^ Apparently, the random guesses 
(first column) and their type I opposites (third column) are the 


same distribution. Their simultaneous consideration therefore 
does not provide any benefit. In contrast, the type II opposites 
(second column) are different. The simultaneous consideration 
of guess and opposite guess does provide a plausible benefit. 

TABLE III. Errors eor Booth Function 



x^ and X 2 

^ 1,11 ^ 2,11 

x^ J and X 2 I 

I. run 

448.78 ± 460.46 

1930 ± 612.73 

432.54 ± 444.90 

2. run 

403.22 ± 425.78 

2000 ± 575.18 

400.88 ± 424.10 

3. run 

470.40 ± 556.03 

1875 ± 717.06 

374.51 ± 388.70 

4. run 

382.35 ± 405.19 

2103 ± 545.19 

400.07 ± 465.64 

5. run 

422.20 ± 483.02 

2058 ± 609.94 

415.63 ± 455.30 


Bukin4 Function - The Bukin4 function is given as 

f{xi,X 2 ) = lOO^lIrcz -O.Olrcfll +0.01||xi + 10||. (14) 

The global minimum is /(xi,X 2 ) = /(—10,0) = 0 where 
the search domain is —15 < xi < —5 and —3 < X 2 < 3. The 
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Fig. 3. The errors and standard deviations (std) for three different runs: In each run opposites are first trained offline with 100 samples (not shown here). The 
diagrams show the error of estimating the true opposites based on fi{x) (Table with 100 new opposites online whereas the rules are clustered and evolved 
with 101,102, • • •, 199, 200 samples, respectively. 


3.5 T 


3 



Fig. 4. The evolving opposites gradually become more accurate after processing each additional data point. The average error (black) and standard deviation 
(gray) decrease with each new sample. At the beginning there may be fluctuations due to lack of sufficient number of data to extract reasonable clusters and 
hence meaningful rules. Standard deviation may grow larger than mean if a small sample contains variables with large range. 


results for Bukin4 function are given in Table IV Apparently, 
the random guesses (first column) and their type I opposites 
(third column) are the same distribution (null hypothesis could 
not be rejected). Their simultaneous consideration therefore 
does not provide any benefit. In contrast, the type II opposites 
(second column) are both different and deliver lower errors. 


TABLE IV. Results for Bukin4 Function 



Xi and X 2 

^1,/j X 21 J 

x'^ j and X 2 I 

1. run 

290.14 ± 271.39 

1.75 ± 1.59 

290.14 ± 271.39 

2. run 

317.69 ± 253.78 

7.34 ±9.6 

317.69 ± 253.78 

3. run 

301.05 ± 301.05 

1.26 ± 1.03 

301.05 ± 289.75 

4. run 

299.18 ± 262.70 

49.06 ±34.26 

299.18 ± 262.70 

5. run 

282.59 ± 274.73 

6.43 ± 5.01 

282.59 ± 274.73 


As apparent from the three tables, the simultaneous con¬ 
sideration of random guess and its opposite, as already demon¬ 
strated in existing literature, has clear benefit as always one of 
them is much closer to the solution. This shows that the learned 
type-II opposites do satisfy the fundamental assumption of 
OBL, namely that the simultaneous consideration of a guess 
and its opposite delivers a shorter path to the solution. 


VI. Summary 

Ten years after introducing opposition-based learning, there 
exists no general algorithm to generate opposites for learning 
and optimization purposes. In this paper, taking the 50th an¬ 
niversary of fuzzy sets as a pretext, we introduced an approach 
to leam true (type II) opposites via evolving fuzzy inference 
systems. The core idea in this paper is “opposition mining” to 
extract (quasi-)opposites by processing the available training 
data. This makes the learning of opposites possible as it 
extracts the necessary data (specially the desired outputs) 
for any learning scheme to approximate. The evolving fuzzy 
rules can then capitalize on these data points and refine the 
estimate of opposition with perpetual incorporation of future 
data points. We tested the proposed algorithm with simple 
test functions, examined its evolving aspect, and investigated 
three widely used optimization benchmark functions as well to 
verify both the correctness and usefulness of type II opposites. 

We used the Takagi-Sugeno fuzzy systems in this work 
to leam the tme opposites. This approach does employ a t- 
norm and hence triggers axis-parallel mles. However, It has 
been clearly shown in recent literature that arbitrarily rotated 
mles are able to outperform the conventional axis-parallel ones 
(El. This is certainly a possibility for potential improvement 
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in future works. As well, in this work we only investigated the 
problems with one output. The extension to multiple output 
systems will certainly be of interest. 

The authors are investigating the benefits of the proposed 
approach for complex learning and optimization problems, and 
hope to report the results in foreseeable future. 
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